Universal Access To All Knowledge
Home Donate | Store | Blog | FAQ | Jobs | Volunteer Positions | Contact | Bios | Forums | Projects | Terms, Privacy, & Copyright
Search: Advanced Search
Anonymous User (login or join us)
Upload

Reply to this post | See parent post | Go Back
View Post [edit]

Poster: aibek Date: Jan 25, 2014 2:31am
Forum: forums Subject: Re: CDX digest not accurately capturing duplicates?

You are right. Wayback Machine server records more than one digest for the same file.

The gif image queried for has apparently stayed the same for the last 13 years. On the linked CDX query page, the digests for almost all the records are the same (7WCS…). But some records have another digest (ISYU…). The latter files, however, are exactly the same as the former files.

Either there is a bug (the bug, however, works in a consistent manner!), or Zarkoff and I have misunderstood what the digest represents.

http://web.archive.org/cdx/search/cdx?url=google.com/intl/en/images/logo.gif&collapse=timestamp:10